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NOVEL POLYPEPTIDE 

CROSS REFERENCE TO RELATED APPLICATION 
This application claims the benefit of U.K. Provisional Application Nos. 
5 0030854.4, filed December 18, 2000, and 0111031.1, filed May 4, 2001, as well as U.S. 
Provisional Application Nos. 60/260,590, filed January 9, 2001, and 60/296,660, filed 
June 7, 2001. 

Technical field 

10 

The present invention relates to a novel polynucleotide sequence which encodes a novel 
polypeptide belonging to the class of proteins known as G-protein coupled receptors 
(GPCRs). The present invention also relates, inter alia, to processes for producing the 
polypeptide and its uses. 

15 

Background of the invention 

Cells and tissues respond to a wide variety of extracellular signalling molecules through 
the interaction of these molecules with specific cell-surface receptors. One such class of 

20 receptors are known as G-protein coupled receptors (GPCRs) and these are characterised 
by containing a series of 7 hydrophobic transmembrane segments. Upon binding an 
extracellular ligand to its receptor, intracellular signals are initiated via interactions with 
heterotrimeric G proteins which in turn can lead to a number of different intracellular 
events depending upon which receptor has been activated. For example some GPCRs 

25 influence adenyl cyclase activity whereas others act via phospholipase C. 

Members of the GPCR superfamily respond to a wide variety of ligands including small 
molecule amines (such as serotonin, dopamine, acetylcholine), purines and pyrimidines 
(such as ATP, ADP, adenosine, UTP, UDP), lipid-derived mediators (such as LP A), 
30 amino acid derivatives (such as glutamate) and neurotransmitter peptides and hormones 
(such as neurokinin, galanin, glucagon, gastrin). Although GPCRs are activated by a 
broad range of ligands, it should be noted that individual GPCRs have a small and very 
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specific repertoire of ligands. Based upon an analysis of the primary structure of a novel 
GPCR, it is now possible to classify them into specific sub-families, thereby narrowing 
the range of potential ligands. 

5 In many cases, the endogenous ligands of GPCRs are relatively small, enabling them to be 
mimicked or blocked by synthetic analogues. For example drugs such as prazosin, 
doxazosin, cimetidine, ranitidine are all effective antagonists of their respective target 
GPCRs. Thus, as the activation or inhibition of GPCRs can have therapeutic 
consequences, there is a continued need to provide new GPCRs and their associated 
10 agonists and antagonists. 

There are several diverse families of receptors, which respond to purines and pyrimidines. 
Examples of members of the GPCR family of such receptors are the adenosine receptors 
designated Al, A2a, A2b, and A3; and several of the P2Y receptors which are stimulated 
15 by UDP, UTP, ADP, and ATP. 

Summary of the invention 

According to one aspect of the present invention, there is provided an isolated 
20 polynucleotide comprising: 

(a) a polynucleotide encoding the polypeptide as set forth in SEQ ID NO: 2; 

(b) a polynucleotide encoding the polypeptide expressed by the DNA contained in 
National Collection of Industrial, Food and Marine Bacteria Limited (NCIMB) 

25 Deposit No. 41101; 

(c) a polynucleotide comprising a nucleotide sequence of SEQ ID NO 1; 

(d) a polynucleotide comprising a nucleotide sequence that has at least 70-75% 
identity to the polynucleotide of any one of (a) to (c); 

(e) a polynucleotide comprising a nucleotide sequence which is capable of 
30 hybridising to the polynucleotide of any one of (a) to (d); or 

(f) a polynucleotide fragment of the polynucleotide of any one of (a) to (e). 
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Preferably, the polynucleotide comprises a nucleotide sequence that has at least 75-80% 
identity to the polynucleotide of any one of (a) to (c) above. More preferably, the 
polynucleotide comprises a nucleotide sequence that has at least 80-85% identity to the 
polynucleotide of any one of (a) to (c) above. Even more preferably, the polynucleotide 
5 comprises a nucleotide sequence that has at least 85-90% identity to the polynucleotide of 
any one of (a) to (c) above. Yet more preferably, the polynucleotide comprises a 
nucleotide sequence that has at least 90-95% identity to the polynucleotide of any one of 
(a) to (c) above. Most preferably, the polynucleotide comprises a nucleotide sequence 
that has greater than 95% identity to the polynucleotide of any one of (a) to (c) above. 

10 

Preferably, the polynucleotide encodes a mature polypeptide encoded by the DNA 
contained in NCIMB Deposit No. 41101. 

The polynucleotide described above preferably encodes a G-protein coupled receptor 
15 (GPCR). 

The present invention also provides a polynucleotide probe or primer comprising at least 
15 contiguous nucleotides of the polynucleotide described above. The invention also 
provides antisense oligonucleotides, which hybridise to the polynucleotide of SEQ ID NO 
20 1 and allelic variants thereof and can be used to modify PFI-019 expression. The 
invention also includes ribozymes, which contain portions of sequence capable of 
hybridising to the polynucleotide of SEQ ID NO 1 . 

The present invention yet further provides a vector comprising the polynucleotide 
25 described above. 

According to a further aspect of the present invention, there is provided a host cell 
transformed or transfected with the vector described above. Preferably, the host cell is a 
mammalian, bacterial or yeast cell. 

30 

According to yet a further aspect of the present invention, there is provided a process for 
producing a polypeptide or fragment thereof comprising culturing said host cell under 
conditions sufficient for the expression of said polypeptide or fragment. Preferably, said 
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polypeptide or fragment is expressed at the surface of said cell. The process preferably 
further includes recovering the polypeptide or fragment from the culture. 

There is also provided by the present invention a process for producing cells capable of 
5 expressing a polypeptide or fragment thereof comprising transforming or transfecting cells 
with the vector described above. 

According to a further embodiment of the present invention, there are provided cells 
produced by the process described above. There is also provided a membrane preparation 
10 of said cells. 

According to another aspect of the present invention, there is provided a polypeptide 
comprising: 

15 (a) a polypeptide having the deduced amino acid sequence translated fix>m the 

polynucleotide sequence in SEQ ID NO: 1 and variants, fragments, homologues, 

analogues and derivatives thereof; 
(b) a polypeptide of SEQ ID NO: 2 and variants, fragments, homologues, analogues 

and derivatives thereof; or 
20 (c) a polypeptide encoded by the cDNA of NCIMB Deposit No. 41 101 and variants, 

fragments, homologues, analogues and derivatives of said polypeptide. 

There is also provided by the present invention an antibody against the polypeptide 
described above. 

25 

The present invention yet further provides a compound, which activates the polypeptide 
described above (an agonist) or which inhibits activation of the polypeptide described 
above (an antagonist). Preferably, such compounds are nucleotides or derivatives thereof. 

30 According to another aspect of the invention, there is provided a method for identifying a 
compound which binds to the polypeptide of the invention, comprising: 
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(a) contacting (i) a detectable compound A, preferably a labelled derivative of a 
purinoceptor ligand known to bind to the polypeptide of the invention, more 
preferably a nucleotide derivative, even more preferably 2-chloro-ATP, 2-methyl-thio- 
ATP, or 2-methyl-thio-ADP, and (ii) a test compound (or mixture of test compounds), 

5 with cells expressing the polypeptide of the invention or a membrane preparation of 
such cells; 

(b) contacting the same amount of detectable compound A with the same amount of cells 
expressing the polypeptide of the invention or a membrane preparation of such cells as 
in step (a) under the same conditions as in step (a) but in the absence of test 

10 compound; 

(c) comparing the amount of compound A bound in steps (a) and (b) thereby identifying a 
test compound (or mixture of test compounds) that competes with or blocks the 
binding of compound A to the polypeptide of the invention. 

15 According to another aspect of the present invention, there is provided a method for 
identifying a compound which binds to and activates the polypeptide described above 
comprising: 

(a) contacting a compound with cells expressing on the surface thereof the polypeptide or 
20 a membrane preparation of said cells, said polypeptide being associated with a second 

component capable of providing a detectable signal in response to the binding of a 
compound to said polypeptide; said contacting being under conditions sufficient to 
permit binding of compounds to the polypeptide; and 

(b) identifying a compound capable of polypeptide binding by detecting the signal 
25 produced by said second component. 

According to another aspect of the present invention, there is provided a method for 
identifying a compound which binds to and inhibits activation of the polypeptide 
described above comprising: 

30 

(a) contacting (i) a detectable first component known to bind to and activate the 
polypeptide and (ii) a compound with cells expressing on the surface thereof the 
polypeptide or a membrane preparation of said cells, said polypeptide being 
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associated with a second component capable of providing a detectable signal in 
response to the binding of. a compound to said polypeptide; said contacting being 
under conditions sufficient to permit binding of compounds to the polypeptide; and 
(b) determining whether the first component binds to the polypeptide by detecting the 
5 absence or otherwise of a signal generated from the interaction of the first component 

with the polypeptide. 

As GPCRs are involved in signal transduction, agonists or antagonists of the polypeptide 
of the present invention can find use in interfering in the signal transduction process. 
10 Consequently, the present invention provides a compound, which activates the 
polypeptide described above (an agonist) or which inhibits activation of the polypeptide 
described above (an antagonist) for use as a pharmaceutical. Such compounds, which can 
act as agonists or antagonists of the polypeptide, can therefore find use in the therapeutic 
areas, which concern aspects of signal transduction. Therapeutically useful areas include, 
15 but are not limited to, neurological disease, psychotherapeutics, urogenital disease, 
reproduction and sexual medicine, inflammation, cancer, tissue repair, dermatology, skin 
pigmentation, photoageing, frailty, osteoporosis, metabolic disease, cardiovascular 
disease, gastrointestinal disease, antiinfection, allergy and respiratory disease, sensory 
organ disorders, sleep disorders and hairloss. Preferably, therapeutically useful areas are 
20 hypertension, asthma, and artherosclerosis. 

Accordingly, there is also provided the use of the above compound (agonist) in the 
manufacture of a medicament in the treatment of a patient having need to activate a 
receptor. 

25 

There is also provided the use of the above compound (antagonist) in the manufacture of a 
medicament in the treatment of a patient having need to inhibit a receptor. 

According to yet a further aspect of the invention, there is provided a method for the 
30 treatment of a patient having need to activate a receptor comprising administering to the 
patient a therapeutically effective amount of the above-described compound (agonist). 
Preferably, said compound (agonist) is a polypeptide and a therapeutically effective 
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amount of the compound is administered by providing to the patient DNA encoding said 
compound and expressing said compound in vivo. 

According to yet a further aspect of the invention, there is also provided a method for the 
5 treatment of a patient having need to inhibit a receptor comprising administering to the 
patient a therapeutically effective amount of the above-described compound (antagonist). 
Preferably, said compound (antagonist) is a polypeptide and a therapeutically effective 
amount of the compound is administered by providing to the patient DNA encoding said 
compound and expressing said compound in vivo. 

10 

There is also provided by the present invention a method for the treatment of a patient 
having need to activate or inhibit a receptor, comprising administering to the patient a 
therapeutically effective amount of the antibody described above. 

15 Yet further provided by the present invention is use of the antibody described above in the 
manufacture of a medicament for the treatment of a patient having need to activate or 
inhibit a receptor. 

According to a further aspect of the present invention, there is provided a method of 
20 treatment of a patient having need to upregulate a receptor, comprising administering to 
the patient a therapeutically effective amount of the polypeptide of the present invention. 
Preferably, said therapeutically effective amount of the polypeptide is administered by 
providing to the patient DNA encoding said polypeptide and expressing said polypeptide 
in vivo. 

25 

There is also provided by the present invention, use of the polypeptide in the manufacture 
of a medicament for the treatment of a patient having need to upregulate a receptor. 

According to yet a further aspect of the present invention, there are provided cells or an 
30 animal genetically engineered to overexpress, underexpress or to exhibit targeted deletion 
of the polypeptide of the present invention. 
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Another aspect of the invention is a method of elucidating the three-dimensional structure 
of the polypeptide of the invention, comprising the steps of: (a) purifying the polypeptide; 
(b) crystallising it, and (c) elucidating the structure, in particular by X-ray crystallography. 

5 Yet a further embodiment of the invention is a method of modelling the structure of the 
polypeptide of the invention, comprising the steps of: (a) aligning the sequence with a 
sequence of a protein of known three-dimensional structure, in particular rhodopsin; (b) 
mapping the detected sequence differences of the polypeptide of the invention onto the 
known structure, (c) deriving a homology model of the polypeptide of the invention. 

10 

Detailed description of the invention 

The polynucleotide, which encodes the GPCR of the present invention was identified 
electronically and analysed using various bioinformatic tools. The GPCR encoded by the 
1 5 sequences described herein has been termed PFI-0 19. 

The term "nucleotide sequence" as used herein refers to an oligonucleotide sequence or 
polynucleotide sequence, and variants, homologues, fragments and derivatives thereof 
(such as portions thereof). The nucleotide sequence may be DNA or RNA of genomic or 
20 synthetic or recombinant origin, which may be double-stranded or single-stranded 
whether representing the sense or antisense strand. 

Preferably, the term "nucleotide sequence" means DNA. More preferably, the term 
"nucleotide sequence" means DNA prepared by use of recombinant DNA techniques (i.e. 
25 recombinant DNA). 

As used herein "amino acid sequence" refers to peptide or protein sequences or portions 
thereof. 

30 The present invention does not cover the native PFI-0 19 when it is in its natural 
environment and when it has been expressed by its native nucleotide coding sequence 
which is also in its natural environment and when that nucleotide sequence is under the 
control of its native promoter which is also in its natural environment, nor does it cover its 
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native nucleotide coding sequence in its natural environment under the control of its 
native promoter in its natural environment. 

As used herein "biologically active" refers to a PFI-019 having structural, regulatory or 
5 biochemical functions of the naturally occurring PFI-019. 

As used herein, "immunological activity" is defined as the capability of the natural, 
recombinant or synthetic PFI-019 or any oligopeptide thereof, to induce a specific 
immune response in appropriate animals or cells and to bind with specific antibodies. 

10 

The term "antibody" includes polyclonal antibodies, monoclonal antibodies, antibody 
fragments produced by proteolytic digestion of whole antibody molecules, such as Fab or 
F(ab'>2 fragments, as well as antibody fragments selected from expression libraries of Fab 
or single-chain Fv fragments. As the skilled person will be well aware, antibodies can be 
15 generated in animals such as mice, rats, rabbits, goats, sheep, etc, by immunising the 
animal with the polypeptide or oligopeptides selected from the sequence of the 
polypeptide. If such oligopeptides are used, they are often coupled to carrier proteins - all 
methods the skilled person will be familiar with. 

20 Monoclonal antibodies may be prepared using any technique, which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but 
are not limited to, the hybridoma technique originally described by Koehler and Milstein 
(1975, Nature 256, 495-497), the human B-cell hybridoma technique (Kosbor et al. (1983) 
Immunol Today 4, 72; Cote et al. (1983) Proc. Natl. Acad. Sci. (USA) 80, 2026-2030) and 

25 the EBV-hybridoma technique (Cole et al. (1985) Monoclonal Antibodies and Cancer 
Therapy, Alan R Liss Inc, pp. 77-96). In addition, techniques developed for the 
production of "chimeric antibodies", the splicing of mouse antibody genes to human 
antibody genes to obtain a molecule with appropriate antigen specificity and biological 
activity can be used (Morrison et al (1984) Proc. Natl. Acad. Sci. (USA) 81, 6851-6855; 

30 Neuberger et al. (1984) Nature 312, 604-608; Takeda et al. (1985) Nature 314, 452-454). 
Alternatively, techniques described for the production of single chain antibodies (US-A- 
4946779) can be adapted to produce polypeptide-specific single chain antibodies. 
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Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening recombinant immunoglobulin libraries or panels of highly 
specific binding reagents as disclosed in Orlandi et al. (1989, Proc. Natl. Acad. Sci. 
(USA) Vol 86 p 3833-3837), and Winter G & Milstein C (1991; Nature 349 p293-299). 

5 

The term "derivative" as used herein includes chemical modification of a PFI-019. 

As used herein, the terms "isolated" and "purified" refer to molecules, either nucleic acids 
or polypeptides/proteins, that are removed from their natural environment and isolated or 
1 0 separated from at least one other component with which they are naturally associated. For 
example, for nucleic acid sequences, the nucleic acid must be separated from at least one 
of the genes with which it is naturally associated. 

There are many methods for purifying proteins known to the skilled person, which can be 
15 applied to purification of the PFI-019 protein. Often a convenient method involves 
engineering the cDNA to introduce a sequence encoding a peptide tag, e.g. a hexa-His tag 
or a Flag peptide tag, either at the 5' end just after the ATG initiation codon, or at the C- 
terminus before the stop codon, so that the expressed protein will be tagged and can be 
purified e.g. on a Ni 2+ chelating column if a hexa-His tag is used, or using commercially 
available anti-Flag peptide antibodies, e.g. for immunoprecipitation or affinity 
chromatography techniques. Expression vectors engineered to contain such tags are 
commercially available, and such methods are well known to the skilled person. 

The invention also encompasses purifying and crystallising the polypeptide, optionally 
25 followed by elucidating the three-dimensional structure, preferably by X-ray 
crystallography. The invention also encompasses deriving a homology model of the 
three-dimensional structure of the polypeptide of the present invention. 

Once the protein is purified, crystals may be obtained with methods similar to those 
30 described by Palczewski et al in Science 289, 739-745 (2000), and the structure can then 
be solved by X-ray crystallography as described in this publication, or other biophysical 
techniques. Alternatively, or additionally, the three-dimensional structure of the 
polypeptide of the invention can also be modelled by homology modelling, comprising 



20 
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the steps of aligning the sequence of the polypeptide of the invention with the sequence of 
a similar polypeptide of known structure, preferably rhodopsin, mapping the sequence 
differences onto the known structure, thereby deriving a model for the three-dimensional 
structure of the polypeptide of the invention. The three-dimensional structure, derived 
5 either by structure determination or by homology modelling, can then be used for 
designing compounds that may bind to the polypeptide of the invention, or prediction 
whether compounds will bind to it. 

The terms "variant", "homologue" or "fragment" in relation to the amino acid sequence 
10 for the preferred polypeptide of the present invention include any substitution of, variation 
of, modification of, replacement of, deletion of or addition of one (or more) amino acid 
from or to the sequence providing the resultant polypeptide has PFI-019 activity. In 
particular, the term "homologue" covers homology with respect to structure and/or 
function. 

15 

The terms "variant", "homologue" or "fragment" in relation to the nucleotide sequence 
coding for the preferred polypeptide of the present invention include any substitution of, 
variation of, modification of, replacement of, deletion of or addition of one (or more) 
nucleic acid from or to the sequence providing the resultant nucleotide sequence codes for 

20 or is capable of coding for a polypeptide having PFI-019 activity. In particular, the term 
"homologue" covers homology with respect to structure and/or function providing the 
resultant nucleotide sequence codes for or is capable of coding for a receptor having PFI- 
019 activity. With respect to sequence homology, preferably there is at least 70-75%, 
more preferably at least 75-80%, more preferably at least 80-85%, more preferably 85- 

25 90%, yet more preferably 90-95%, and most preferably greater than 95% identity to the 
polynucleotide sequence shown in SEQ ID NO: 1 . 

In particular, "homology" as used herein can be determined by commercially available 
computer programs that produce an optimal alignment between two sequences and then 
30 calculate % homology (i.e. when comparing protein sequences, also scoring for 
conservative substitutions, such as a change between Lysine and Arginine, for which the 
software will use standard scoring matrices) and % identity (i.e. only counting identical 
residues) between the aligned sequences. Typical examples of such computer programs 
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are GAP and BESTFIT, which are part of the GCG suite of programs (Devereux et al 
(1984) Nucl. Acids Res. 12, 387; Wisconsin Package Version 10, Genetics Computer 
Group, Madison, Wisconsin), or ClustalW (Thompson, J.D. et al (1994) Nucl. Acids Res. 
22, 4673-80) which can also be used for multiple sequence alignments. 

5 

As used herein, the terms "variant", "homologue", "fragment" and "derivative" also 
include allelic variations of the sequences. 

The term "variant" also encompasses sequences that are complementary to sequences that 
10 are capable of hydridising to the nucleotide sequences presented herein. Preferably, the 
term "variant" encompasses sequences that are complementary to sequences that are 
capable of hydridising under conditions of medium to high stringency (e.g. 55-65°C and 
O.lxSSC {lxSSC = 0.15 M NaCl, 0.015 Na 3 citrate pH 7.0» to the nucleotide sequences 
presented herein. 

15 

The present invention also covers nucleotide sequences that can hybridise to the 
nucleotide sequences of the present invention (including complementary sequences of 
those presented herein). In a preferred aspect, the present invention covers nucleotide 
sequences that can hybridise to the nucleotide sequence of the present invention under 

20 conditions of medium to high stringency (e.g. 55-65°C and O.lxSSC). Such 
polynucleotides or oligonucleotides may be used as probes, or for amplifying all or part of 
the sequence of the invention when used as PCR primer. These sequences may also be 
used to modulate the expression of PFI-019 through antisense techniques or the use of 
ribozymes. Antisense nucleic acids, preferably oligonucleotide about 10 to 30 bases long, 

25 capable of specifically binding to the PFI-019 mRNA transcript, i.e. complementary to the 
sequence in SEQ ID NO 1, are introduced into cells by standard techniques (e.g. using 
liposomes), bind to the target nucleotide sequences in the cells and thereby prevents 
transcription and/or translation of the target sequence. The antisense oligonucleotides are 
often made more stable by modifications such as using phosphorothioate or 

30 methylphosphonate oligonucleotides. 

Antisense sequences can also be incorporated into ribozymes such as hammerhead or 
hairpin ribozymes. These can also be introduced into cells and are thought to cleave the 
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specific transcripts and thereby prevent their translation. Such ribozymes can be 
introduced into cells by gene therapy approaches, or by standard techniques, e.g. using 
viral vectors or liposomes. They may also be modified chemically to increase their 
stability to nuclease digestion. 

5 

Details about antisense and ribozyme technologies can be found in textbooks such as L 
Gibson (Ed.) Antisense and Ribozyme Methodology, Chapman&Hall; R. 
Schlingensiepen (1997) Antisense - From Technology to Therapy: Lab Manual and 
Textbook, Blackwell Science Inc.; P.C. Turner (Ed.) (1997) Ribozyme Protocols, 
10 Humana Press. 

The term "vector" includes expression vectors and transformation vectors. The term 
"expression vector" means a construct capable of in vivo or in vitro expression. The term 
"transformation vector" means a construct capable of being transferred from one species 
15 to another. 

The term "purinoceptor ligand" refers to a known ligand of the family of purinbceptors 
such as P2Y1 receptors. Examples of such ligands include 2-chloro-ATP, 2-methyl-thio- 
ATP or 2-methyl-thio-ADP. 

20 

Methods how to obtain transgenic animals can be found in I.J. Jackson, CM. Abbott 
(Eds) (2000) Mouse Genetics and Transgenics: A Practical Approach, Oxford University 
Press, and in M.J. Tymms & I. Kola (Eds) (2001) Gene Knockout Protocols (Methods in 
Molecular Biology, Vol 158), Humana Press. 

25 

Methods for gene therapy approaches are covered in T.F. Kresina (2000) An Introduction 
to Molecular Medicine and Gene Therapy; John Wiley & Sons, and in T. Friedmann (Ed.) 
(1998) The Development of Human Gene Therapy (Cold Spring Harbor Monograph 
Series, 36), Cold Spring Harbor Laboratory. 

30 

For human use, the compounds of the invention, and their pharmaceutically acceptable 
salts, can be administered alone but will generally be administered in admixture with a 
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suitable pharmaceutical excipient, diluent or carrier selected with regard to the intended 
route of administration and standard pharmaceutical practice. 

For example, the compounds, and their pharmaceutically acceptable salts, can be 
5 administered orally, buccally or sublingually in the form of tablets, capsules, ovules, 
elixirs, solutions or suspensions, which may contain flavouring or colouring agents, for 
immediate-, delayed-, modified-, sustained-, pulsed- or controlled-release applications. 

The compounds can also be administered parenterally, for example, intravenously, intra- 
10 arterially, intraperitoneally, intrathecally, intraventricularly, intraurethrally, intrasternally, 
intracranially, intramuscularly or subcutaneously, or they may be administered by infusion 
or needleless injection techniques. For such parenteral administration they are best used 
in the form of a sterile aqueous solution which may contain other substances, for example, 
enough salts or glucose to make the solution isotonic with blood. The aqueous solutions 
15 should be suitably buffered (preferably to a pH of from 3 to 9), if necessary. The 
preparation of suitable parenteral formulations under sterile conditions is readily 
accomplished by standard pharmaceutical techniques well-known to those skilled in the 
art. 

20 DEPOSITS 

The following sample was deposited in accordance with the Budapest Treaty at the 
recognised depositary The National Collections of Industrial, Food and Marine Bacteria 
Limited (NCIMB) at 23 St. Machar Drive, Aberdeen, Scotland, AB24 3RY, United 
Kingdom on 10 April 2001 : 

25 

NCIMB number NCIMB 41 101 is Escherichia coli Pfi-019. 

The depositor was Pfizer Limited, Ramsgate Road, Sandwich, Kent, CT13 9NJ, United 
Kingdom. 

30 

One skilled in the art could readily grow the above-mentioned E. coli clone (NCIMB 41101) 
in Luria Broth containing ampicillin and isolate the plasmid DNA of the clone using the 
alkali lysis method as described in Sambrook, et al. 9 eds. (1989) Molecular Cloning: A 
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Laboratory Manual, Cold Spring Harbor Laboratory Press, New York, NY, USA. The di- 
deoxy termination method as described by Sanger et aL (Proc Natl Acad Sci (USA) (1977), 
74, 5463-5467) and modified by Applied Biosystems (see Applied Biosystems 
manufacturer's literature) for fluorescent detection could then be used to sequence the DNA 
5 and identify PFI-019. 

The present invention also encompasses sequences derivable and/or expressable from that 
deposit and embodiments comprising the same. The present invention also encompasses 
partial sequences derivable and/or expressable from that deposit and embodiments 

10 comprising the same, wherein those partial sequences code for active polypeptides. The 
present invention also encompasses proteins comprising sequences derivable and/or 
expressable from that deposit and embodiments comprising the same. The present 
invention also encompasses proteins comprising partial sequences derivable and/or 
expressable from that deposit and embodiments comprising the same, wherein those partial 

15 sequences code for active polypeptides. 
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EXAMPLES 

The present invention will now be described, by way of example only, with reference to 
the accompanying Figures and Sequence Listing in which:- 

Figure 1 shows a flow diagram illustrating the bioinformatics analysis for the sequence of 
PFI-019. 

Figure 2 shows a ClustalW alignment of PFI-019 with the P2Y purinoceptor 1 (P2Y1). 

Figure 3 shows the results of a functional, cell-based assay, showing the stimulation of 

PFI-019 by 2-chloro-ATP, using a FLBPR® technology. 

Figure 4 shows the stimulation of PFI-019 by 2-methyl-thio-ATP. 

Figure 5 shows the stimulation of PFI-0 1 9 by 2-methyl-thio-ADP. 

Figure 6 shows the stimulation of PFI-019 by UTP. 

SEQ ID NO: 1 shows the nucleotide sequence coding for PFI-019. 

SEQ ID NO: 2 shows the corresponding amino acid sequence coding for PFI-019. 

SEQ ID NOS: 3 and 4 show the PCR primers used in the Examples. 

Example 1 : The identification of PFI-019 

PFI-019 was identified in the Incyte database by searching the sequences with known 
members of the G-protein coupled receptor (GPCR) family using the BLAST algorithm. 
In order to confirm that PFI-019 was a member of the GPCR family, a number of 
bioinformatics approaches were performed, as shown in Fig. 1 . 

(a) BLAST Search against Swissprot 

PFI-019 was searched against Swissprot using the BLAST algorithm (Basic Local 
Alignment Search Tool (Altschul SF (1993) J.Mol. Evol. 36:290-300; Altschul, SF et al 
(1990) J. MoL Biol. 215:403-410) to identify the closest protein match. In this case the 
top hit was to Swissprot accession number P47900, P2Y purinoceptor 1 (P2Y1) . 

These results indicate that PFI-019 is a member of the GPCR family. 

(b) ClustalW Alignment of PFI-019 with the P2Y purinoceptor 1 (P2Y1) 
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These results are shown in Figure 2. A star (*) underneath the sequence comparison 
indicates identical residues in both sequences; a colon (:) indicates a conservative 
difference between the two sequences (e.g. an Arginine residue in one sequence, with the 
5 second sequence having a Lysine in the corresponding position); a point (.) underneath 
indicates that both sequences have similar amino acids in this position (e.g. an Alanine in 
one, a Valine in the second sequence). The assignment of these symbols is performed by 
the software according to scoring matrices such as Blosum62, well known to the skilled 
person. 

10 

(c) BLAST search against a non-redundant human GPCR database 

PFI-019 was searched against a non-redundant human GPCR database comprising mainly 
sequences from Genbank and Derwent Geneseq databases in order to identify the class of 
1 5 potential agonists for this receptor. The top ten hits are shown below: 







e -value 


P2Y purinoceptor 1 (P2Y1) [L:373] 


223 


2e- 


59 


Uridine nucleotide receptor (UNR) [L: . . . 


203 


3e- 


53 


P2U purinoceptor 2 (P2U2) (geneseqp) [L: . . . 


187 


2e- 


48 


P2U purinoceptor 1 (P2U1) [L:377] 


187 


2e- 


48 


Cysteinyl Leukotriene receptor CysLT2 (P. . . 


185 


7e- 


48 


P2Y purinoceptor 6 (P2Y6) [L:32 8] 


158 


8e- 


40 


G protein-coupled receptor GPR17 [L:339] 


155 


9e- 


39 


CGR9 [receptor for CCL2 5 (TECK) ] [L : . . . 


151 


le- 


37 


G-protein-coupled receptor (celera) . . . 


150 


3e- 


37 


Thrombin receptor [L:425] 


150 


3e- 


•37 


(e value = statistical likelihood of the hit occurring by chance) 







30 These results demonstrate that PFI-019 is most similar to P2Y receptors and they suggest 
that PFI-019 encodes a novel GPCR whose ligand is likely to be a nucleotide or a 
nucleotide derivative. 



It will be appreciated that the foregoing is provided by way of example only and 
35 modification of detail may be made without departing from the scope of the invention. 
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Example 2: Isolation of PFI-019 

Utilising PFI-019 gene specific primers (PFI-019 forward and PFI-019 reverse; SEQ ID 
NOs: 3 and 4, respectively), these were employed in a PCR to amplify the PFI-019 coding 
5 region from human genomic DNA (Boehringer Mannheim), where the conditions were as 
follows:- 



PCR mix: 

PFI-019 primers T ul (10 uM stock) 

10 Human genomic DNA 2 ul (400ng) 

dNTPs (concentration as per kit) 1 ul 
platinum Taq high fidelity Polymerase (LTI, Inc.) 0.5 ul 

1 Ox amplification Buffer (from PCR kit) 5 ul 

MgS0 4 1.5 ul 

15 dH 2 0 39 ul 



PCR primers: 

Forward Primer (= PFI-019 forward); SEQ ID NO: 3: 

5'- ACC ATG AAT GAG CCA CTA GAC TAT TTA GCA AAT-3' (SEQ ID NO: 3) 
20 Reverse Primer (= PFI-019 reverse); SEQ ID NO: 4: 

5'- TCA AGG GTT GTT TGA GTA ACT AAT TTT CTT -3' (SEQ ID NO: 4) 

PCR cycle: 

(l)94°C2mins 
25 (2) 94°C 30 seconds 

(3) 54°C 30 seconds 

(4) 68°C 2 mins 

Steps (2) through to (4) were repeated for a further 27 cycles. 

(5) 68°C 15 mins 
30 (6) 4°C soak. 
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The PFI-019 PCR product was TOPO cloned (Invitrogen TOPO cloning methodology) 
into the vector pcDNA4.1/His-Max-TOPO (Invitrogen), according to the manufacturer's 
instructions. The resulting insert was subsequently sequence-verified on both strands 
using ABI DNA sequencing methodology as per the manufacturer's protocol. 

5 

Example 3: Tissue distribution of PFI-019 

Electronic northern (i.e. analysis of EST sequences in databases) identifies an EST 
comprising PFI-019 DNA sequence in a colon cDNA library. 

10 

Example 4: Functional cell-based assays for agonist activation of PFI-019 

Fluorescence Imaging Plate Reader (FLIPR®) technology was employed as a means to 
detect activation of PFI-019 by agonists in a cell-based assay. 

15 

5 x 10 6 Human Embryonic Kidney (HEK) 293 cells expressing the mouse Gal 5 gene 
(from here on called '293 cells'), were transiently transfected with 7.5 jig of PFI-019 
(contained within the pcDNA4HIS-max-TOPO (Invitrogen) plasmid) vector, or vector 
alone, using Lipofectamine Plus® reagent (Gibco BRL) as per the manufacturer's 

20 protocol. The plasmid pcDNA4HIS-max-TOPO was used as it contains elements that 
Up-regulate the level of gene transcription over standard pcDNA3.1 vectors. 24 hrs post- 
transfection, the cells were detached from the flask using Trypsin/EDTA solution (LTI) 
and seeded into a black sided, Poly-D-lysine-treated, 96-well plate (Becton Dickinson) at 
5 x 10 4 cells/well density. The plates were left overnight to allow the cells to adhere to 

25 the bottom of the wells. The medium was removed from the cells and replaced with 100 
ul warm (37°C) dye loading solution (50 ug Fluo3 (Molecular Probes) in 20 ul DMSO + 
20% pluronic acid in DMSO, added to 11 ml Dulbecco's Modified Eagles Medium 
containing lx Probenecid (lOOx Probenecid - 0.71 g Probenecid was dissolved in 5 ml 
1M NaOH and 5 ml Dulbeccos' Phosphate Buffered Saline (PBS), per plate; Probenecid 

30 (Molecular Probes) inhibits activity of the anion transport protein, thus improving dye 
loading). The plates were then incubated for 1 hr at 37°C. Plates were subsequently 
washed with 250 of wash buffer per weil (5 ml lOOx Probenecid stock + 495 ml PBS, 
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pH 7.4) 4 times. The plates were returned to the 37°C/5%C0 2 incubator for 30 mins prior 
to processing within the FLIPR® instrument. The FLIPR® processing involved reading 
the fluorescence for all samples for 2 minutes; during this time the fluorescence baseline 
was determined for the first 10 seconds. The desired amount of compound was then 
5 automatically transferred to the wells, and the fluorescence was continuously monitored 
for the remainder of the time. All compounds were diluted in wash buffer 

Analysis of PFI-019 activation by various purinoceptor agonist compounds in a 
FLIPR® cell-based assay 

10 

Using methodology as described in detail above, purinoceptor agonist compounds were 
identified as being able to functionally activate PFI-01 9. 

Figures 3, 4, 5 and 6 show the response of PFI-01 9-transfected 293 cells to 2- 
15 chloroadenosine triphosphate tetrasodium (2-chloro-ATP; Fig. 3), 2-Methylthioadenosine 

triphosphate tetrasodium (2-methyl-thio-ATP; Fig. 4;); 2-Methylthioadenosine 

diphosphate trisodium (2-methyl-thio-ADP, Fig. 5) and Uridine triphosphate (Fig. 6). 

The graphs show fluorescence intensity versus time (in seconds); the black line shows the 

response to the respective compound by PFI-01 9-expressing cells, whereas the grey line 
20 shows the response by mock-transfected cells. All compounds were purchased from 

Sigma. Vector-only transfected 293 cells gave no measurable response to these 

compounds. 

Example 5 Engineering of stable cell lines expressing high levels of PFI-019 

25 

A suitable host cell line, e.g. HEK293 cells or CHO cells (engineered to express a desired 
G protein such as Gal 5), is transfected as described in Example 4, using Lipofectamine 
or electroporation, with a suitable mammalian cell expression vector containing the 
cDNA (preferably without any 5' or 3' untranslated regions) encoding PFI-019, and 
30 containing a selectable marker, e.g. a neomycin resistance gene. Following transfection, 
selection pressure is applied, e.g. by adding 400-800 ng/ml G418 to the growth medium 
and thereby killing all cells, which have not taken up the vector which contains the 
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neomycin resistance gene. After about 3-4 weeks of selection, individual clones are 
picked and expanded for further analysis. The individual clones can be analysed e.g. by 
Northern blot, using a labelled probe designed from the PFI-019 cDNA sequence. 

S Example 6: Ligand binding assays 

Cells expressing PFI-019, either 24-72 hours after transient transfection as described in 
Example 4, or engineered as described in Example 5, are harvested by scraping, 
resuspended in 20 ml of ice-cold assay buffer (50 mM Tris-HCl pH 7.4), homogenised, 
10 and the resulting suspension is centrifuged at 20,000g, 4°C for 30 minutes. The 
supernatant is decanted, the pellet resuspended in 3 ml of assay buffer and re- 
homogenised (50 mM Tris-HCl pH7.4). The protein concentration is determined via 
Bradford's assay (Biorad), according to the manufacturer's recommendations. 

15 Aliquots of this membrane preparation containing 200 jig protein are then incubated with 
various potential ligands, such as nucleotides, nucleotide analogues, radiolabeled to high 
specific activity, for about 2 hrs at room temperature or at 30°C (the optimal conditions, 
ion concentrations, incubation time and temperature need to be determined for each 
ligand). To terminate incubations, samples are rapidly filtered using the Brandell cell 

20 harvester onto Wallac Filtermats (Perkin Elmer) (which have been previously soaked (for 
lh) in a 0.3% (v/v) solution of PEI (polyethylenimine; Sigma) in assay buffer to reduce 
Filtermat binding). Immediately, the Filtermat/wells are washed four times in rapid 
succession with 2 ml of assay buffer per well. Filtermats are dried using a microwave 
oven, and Meltilex scintillant (Perkin Elmer) is melted onto the Filtermats using the 

25 Wallac Meltilex heat sealer. The bound radioactivity on the Filtermats is determined 
using the Wallac betaplate scintillation counter. 

The specific binding is defined as the difference between total radioactivity bound minus 
the radioactivity measured in the presence of an excess of unlabelled ligand. Mock- 
30 transfected cells are also measured to assess whether the host cells express receptors for 
the ligands used endogenously. 
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Example?: p-1 acta mase assay 

A CHO cell line engineered to stably contain cyclic AMP response elements (CRE) 
functionally linked to the coding region of reporter gene P-lactamase as well as the 
5 nuclear factor of activated T-cell promoter NF-AT (Flanagan et al (1991) Nature 352, 
803-807) linked to the coding region of reporter gene P-lactamase (CHO-CRE-NFAT- 
BLA) is transfected stably as described in Example 5, with a plasmid containing the 
cDNA encoding PFI-019 functionally linked to a promoter that drives expression in 
mammalian cells, e.g. pcDNA3.1, and selected for stable expression of PFI-019. 

10 

The CHO-CRE-NFAT-BLA cells expressing PFI-019 are then seeded at 4x 10 3 cells per 
well in 96-well plates, and incubated for 60 hours at 37°C in a C0 2 incubator (5% C0 2 ). 
The medium is then removed, and 90 ^1 starvation medium (DMEM with high glucose, 
0.1 mM Non-essential amino acids, ImM sodium pyruvate, 25mM Hepes buffer, without 

15 serum or antibiotics) is added to each well, and the cells are incubated overnight. The 
cells are then stimulated by addition of 10 |al 2-chloro-ATP or 2-methyl-thio-ATP (or 1 
|liM ionomycin for positive control) prepared in DMEM with 1% dialysed fetal bovine 
serum per well. Following incubation at 37°C/5% C0 2 for 5 hours* 20|xl of 6x dye 
solution (CCF2 Loading kit from Aurora, Cat # 00 100 012, contains solutions A-D; to 

20 prepare 6x dye solution, 36|xl solution A (CCF2-AM), 1 80|il solution B, 2.8ml solution C 
and 225 fil solution D are mixed according to the instructions) are added per well, and the 
plate is incubated on a rocking platform in the dark at room temperature for 1 hour 
(rocking at 40 cycles per minute). The fluorescence is then measured in a Cytofluor 4000 
(PerSeptive Biosystems), using an excitation wavelength of 405 nm, and measuring 

25 emission at wavelengths of 450 nm and 530 nm. 

When the ligand stimulates the receptor and the response leads to either a change in 
cAMP concentration or in calcium concentration in the cells, P-lactamase will be 
expressed in the cells. The dye is composed of a blue (coumarin) and a green 
30 (fluorescein) component which are linked by a P-lactam linker group. When excited at 
405 nm, fluorescence energy transfer will occur within the uncleaved molecule, and the 
emission wavelength will be green (around 530 nm). When the linker is cleaved by P- 
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lactamase, no energy transfer can occur, and blue fluorescence results, measured at 450 
nm. Measuring the ratio of blue to green fluorescence will give an indication of receptor 
stimulation. The ratio is agonist dose dependent, and can be used to rank agonists for the 
receptor. 



